Using real words for recording diphones

نویسنده

  • Susan Fitt
چکیده

This paper focuses on the creation of word-lists for making diphone recordings for speech synthesis. Such lists often consist of nonsense words, which has the advantage that the phonetic environment can be constrained, and it is easy to produce lists containing all possible combinations. However, this approach has the disadvantage that non-experts may find it difficult to read the nonsense-word transcriptions. For this reason, we investigate here the issues associated with the use of real words in creating diphone recordings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increased Diphone Recognition for an Afrikaans TTS system

In this paper we discuss the implementation of an Afrikaans TTS system that is based on diphones. Using diphones makes the system flexible but presents other challenges. A previous effort to design an Afrikaans TTS system was done by SUN. They implemented a TTS system based on full words. A full word based TTS system produces more natural sounding speech than when the system is designed using o...

متن کامل

Pitch-effects in diphone recording: are logatomes inappropriate?

The most obvious difference between recordings of German words and non-sense words (logatomes) within the BITS project was an audibly noticeable difference in pitch: diphones obtained from logatomes seemed to have a higher pitch than those from words. We proved this by measuring the pitches of diphones concatenated to sentences, and by comparing the pitches of actually uttered logatomes with pi...

متن کامل

Tuning Limited Domain Speech Synthesis Using General TTS System

The subject of the present paper is the building of a limited domain speech synthesis system, where longer units, like words and phrases, can naturally be concatenated together. However, instead of building a single-purpose domainoriented engine working with longer units, we show that a general-purpose TTS system can be used as a good emulation tool to ensure that a real domain-oriented engine ...

متن کامل

Real-Time Performance Controllers for Synthesized Singing

A wide variety of singing synthesis models and methods exist, but there are remarkably few real-time controllers for these models. This paper describes a variety of devices developed over the last few years for controlling singing synthesis models implemented in the Synthesis Toolkit in C++ (STK), Max/MSP, and ChucK. All of the controllers share some common features, such as air-pressure sensin...

متن کامل

A database design for a TTS synthesis system using lexical diphones

Database designs, if based on the premise that there are about 2000 diphones in English, as stated in many publications and on-line documents, are likely to render a database of diphones, which will fail to capture some important phonological phenomena of English. This paper proposes a TTS database, which is built from diphones inclusive of their syllabic stress; we term these units lexical dip...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001